智能论文笔记

Fully complex-valued deep learning model for visual perception

Aniruddh Sikdar , Sumanth Udupa , Suresh Sundaram

分类：计算机视觉

2022-12-14

Deep learning models operating in the complex domain are used due to their rich representation capacity. However, most of these models are either restricted to the first quadrant of the complex plane or project the complex-valued data into the real domain, causing a loss of information. This paper proposes that operating entirely in the complex domain increases the overall performance of complex-valued models. A novel, fully complex-valued learning scheme is proposed to train a Fully Complex-valued Convolutional Neural Network (FC-CNN) using a newly proposed complex-valued loss function and training strategy. Benchmarked on CIFAR-10, SVHN, and CIFAR-100, FC-CNN has a 4-10% gain compared to its real-valued counterpart, maintaining the model complexity. With fewer parameters, it achieves comparable performance to state-of-the-art complex-valued models on CIFAR-10 and SVHN. For the CIFAR-100 dataset, it achieves state-of-the-art performance with 25% fewer parameters. FC-CNN shows better training efficiency and much faster convergence than all the other models.

translated by 谷歌翻译

Fully Complex-valued Fully Convolutional Multi-feature Fusion Network (FC2MFN) for Building Segmentation of InSAR images

Aniruddh Sikdar , Sumanth Udupa , Suresh Sundaram , Narasimhan Sundararajan

分类：计算机视觉

2022-12-14

Building segmentation in high-resolution InSAR images is a challenging task that can be useful for large-scale surveillance. Although complex-valued deep learning networks perform better than their real-valued counterparts for complex-valued SAR data, phase information is not retained throughout the network, which causes a loss of information. This paper proposes a Fully Complex-valued, Fully Convolutional Multi-feature Fusion Network(FC2MFN) for building semantic segmentation on InSAR images using a novel, fully complex-valued learning scheme. The network learns multi-scale features, performs multi-feature fusion, and has a complex-valued output. For the particularity of complex-valued InSAR data, a new complex-valued pooling layer is proposed that compares complex numbers considering their magnitude and phase. This helps the network retain the phase information even through the pooling layer. Experimental results on the simulated InSAR dataset show that FC2MFN achieves better results compared to other state-of-the-art methods in terms of segmentation performance and model complexity.

translated by 谷歌翻译

Multi-Modal Domain Fusion for Multi-modal Aerial View Object Classification

Sumanth Udupa , Aniruddh Sikdar , Suresh Sundaram

分类：计算机视觉

2022-12-14

Object detection and classification using aerial images is a challenging task as the information regarding targets are not abundant. Synthetic Aperture Radar(SAR) images can be used for Automatic Target Recognition(ATR) systems as it can operate in all-weather conditions and in low light settings. But, SAR images contain salt and pepper noise(speckle noise) that cause hindrance for the deep learning models to extract meaningful features. Using just aerial view Electro-optical(EO) images for ATR systems may also not result in high accuracy as these images are of low resolution and also do not provide ample information in extreme weather conditions. Therefore, information from multiple sensors can be used to enhance the performance of Automatic Target Recognition(ATR) systems. In this paper, we explore a methodology to use both EO and SAR sensor information to effectively improve the performance of the ATR systems by handling the shortcomings of each of the sensors. A novel Multi-Modal Domain Fusion(MDF) network is proposed to learn the domain invariant features from multi-modal data and use it to accurately classify the aerial view objects. The proposed MDF network achieves top-10 performance in the Track-1 with an accuracy of 25.3 % and top-5 performance in Track-2 with an accuracy of 34.26 % in the test phase on the PBVS MAVOC Challenge dataset [18].

translated by 谷歌翻译

Computationally Light Spectrally Normalized Memory Neuron Network based Estimator for GPS-Denied operation of Micro UAV

Nishanth Rao , Suresh Sundaram , Varun Raghavendra

分类：机器人

2022-11-12

This paper addresses the problem of position estimation in UAVs operating in a cluttered environment where GPS information is unavailable. A model learning-based approach is proposed that takes in the rotor RPMs and past state as input and predicts the one-step-ahead position of the UAV using a novel spectral-normalized memory neural network (SN-MNN). The spectral normalization guarantees stable and reliable prediction performance. The predicted position is transformed to global coordinate frame which is then fused along with the odometry of other peripheral sensors like IMU, barometer, compass etc., using the onboard extended Kalman filter to estimate the states of the UAV. The experimental flight data collected from a motion capture facility using a micro-UAV is used to train the SN-MNN. The PX4-ECL library is used to replay the flight data using the proposed algorithm, and the estimated position is compared with actual ground truth data. The proposed algorithm doesn't require any additional onboard sensors, and is computationally light. The performance of the proposed approach is compared with the current state-of-art GPS-denied algorithms, and it can be seen that the proposed algorithm has the least RMSE for position estimates.

translated by 谷歌翻译

An Efficient Approach with Dynamic Multi-Swarm of UAVs for Forest Firefighting

Josy John , K. Harikumar , J. Senthilnath , Suresh Sundaram

分类：机器人

2022-11-03

In this paper, the Multi-Swarm Cooperative Information-driven search and Divide and Conquer mitigation control (MSCIDC) approach is proposed for faster detection and mitigation of forest fire by reducing the loss of biodiversity, nutrients, soil moisture, and other intangible benefits. A swarm is a cooperative group of Unmanned Aerial Vehicles (UAVs) that fly together to search and quench the fire effectively. The multi-swarm cooperative information-driven search uses a multi-level search comprising cooperative information-driven exploration and exploitation for quick/accurate detection of fire location. The search level is selected based on the thermal sensor information about the potential fire area. The dynamicity of swarms, aided by global regulative repulsion and merging between swarms, reduces the detection and mitigation time compared to the existing methods. The local attraction among the members of the swarm helps the non-detector members to reach the fire location faster, and divide-and-conquer mitigation control ensures a non-overlapping fire sector allocation for all members quenching the fire. The performance of MSCIDC has been compared with different multi-UAV methods using a simulated environment of pine forest. The performance clearly shows that MSCIDC mitigates fire much faster than the multi-UAV methods. The Monte-Carlo simulation results indicate that the proposed method reduces the average forest area burnt by $65\%$ and mission time by $60\%$ compared to the best result case of the multi-UAV approaches, guaranteeing a faster and successful mission.

translated by 谷歌翻译

Control Barrier Functions in UGVs for Kinematic Obstacle Avoidance: A Collision Cone Approach

Phani Thontepu , Bhavya Giri Goswami , Neelaksh Singh , Shyamsundar P I , Shyam Sundar M G , Suresh Sundaram , Vaibhav Katewa , Shishir Kolathaya.

分类：机器人

2022-09-23

在本文中，我们提出了针对无人接地车辆（UGV）的新的控制屏障功能（CBF），该功能有助于避免与运动学（非零速度）障碍物发生冲突。尽管当前的CBF形式已经成功地保证了与静态障碍物的安全/碰撞避免安全性，但动态案例的扩展已获得有限的成功。此外，借助UGV模型，例如Unicycle或自行车，现有CBF的应用在控制方面是保守的，即在某些情况下不可能进行转向/推力控制。从经典的碰撞锥中汲取灵感来避免轨迹规划，我们介绍了其新颖的CBF配方，并具有对独轮车和自行车模型的安全性保证。主要思想是确保障碍物的速度W.R.T.车辆总是指向车辆。因此，我们构建了一个约束，该约束确保速度向量始终避开指向车辆的向量锥。这种新控制方法的功效在哥白尼移动机器人上进行了实验验证。我们将其进一步扩展到以自行车模型的形式扩展到自动驾驶汽车，并在Carla模拟器中的各种情况下证明了避免碰撞。

translated by 谷歌翻译

Decentralized Distributed Expert Assisted Learning (D2EAL) approach for cooperative target-tracking

Shubhankar Gupta , Suresh Sundaram

分类：机器人

2022-09-22

本文解决了使用异质多机器人系统进行合作目标跟踪的问题，该系统在该系统上通过动态通信网络进行通信，而异质性则是在机器人中安装的不同类型的传感器和预测算法方面。该问题被投入到分布式学习框架中，在该框架中，机器人被认为是通过动态通信网络连接的“代理”。他们的预测算法被认为是“专家”，对目标轨迹的看法预测。在本文中，提出了一种新颖的分散分布式专家辅助学习（D2EAL）算法，提出了通过使每个机器人通过其信息共享来改善目标轨迹的外观预测，并运行加权信息，从而改善了整体跟踪性能。融合过程结合了基于预测损失度量的在线学习权重。对D2EAL进行了理论分析，该分析涉及对累积预测损失的最坏情况界限的分析以及权重分析。仿真研究表明，在涉及专家预测中涉及大动态偏见或漂移的不利场景中，D2EAL优于众所周知的基于协方差的估计/预测融合方法，无论是在预测性能和可伸缩性方面。

translated by 谷歌翻译

An Input-Output Feedback Linearization based Exponentially Stable Controller for Multi-UAV Payload Transport

Nishanth Rao , Suresh Sundaram

分类：机器人

2022-07-10

在本文中，提出了一个稳定稳定的轨迹跟踪控制器，用于多uav有效载荷运输。多uav有效负载系统在无人机和有效负载框架的垂直刚性链接之间具有2DOF磁球接头，因此无人机可以自由滚动或自由投球。这些垂直链接紧密地连接到有效载荷上，无法移动。为完整的有效载体 - uav系统得出了输入输出反馈线性化模型以及有效载荷轨迹跟踪的推力矢量控制。关于跟踪控制定律的理论分析表明，控制定律是指数稳定的，从而确保了沿期望轨迹的安全运输。为了验证拟议的控制定律的性能，提供了数值模拟以及高保真凉亭实时仿真的结果。接下来，针对两种实际情况分析了提议的控制器的鲁棒性：有效载荷和有效载荷质量不确定性的外部干扰。结果清楚地表明，所提出的控制器在实现指数稳定的轨迹跟踪的同时具有稳健性和计算效率。

translated by 谷歌翻译

Offline Text-Independent Writer Identification based on word level data

Vineet Kumar , Suresh Sundaram

分类：计算机视觉

2022-02-21

本文提出了一种新的方案，以根据个人的手写输入单词图像来识别文档的作者身份。我们的方法是与文本无关的，并且对所考虑的输入单词图像的大小没有任何限制。首先，我们采用SIFT算法在不同级别的抽象（包括字符的特征或组合）上提取多个关键点。然后，这些关键点通过训练有素的CNN网络，以生成与卷积层相对应的特征图。但是，由于比例对应于SIFT密钥点，生成的特征映射的大小可能会有所不同。为了缓解此问题，将梯度的直方图应用于特征图上以产生固定表示。通常，在CNN中，每个卷积块的过滤器数量增加，具体取决于网络的深度。因此，为每个卷积特征图提取直方图特征增加了尺寸以及计算负载。为了解决这一方面，我们使用基于熵的方法来学习算法的训练阶段中特定CNN层的特征图的权重。我们提出的系统的功效已在两个公开数据库中证明，即CVL和IAM。我们从经验上表明，与以前的作品相比，获得的结果是有希望的。

translated by 谷歌翻译

Robust EMRAN-aided Coupled Controller for Autonomous Vehicles

Sauranil Debarshi , Suresh Sundaram , Narasimhan Sundararajan

分类：机器人

2021-06-22

本文介绍了一种用于自主车辆的耦合，神经网络辅助纵向巡航和横向路径跟踪控制器，具有模型不确定性和经历未知的外部干扰。使用反馈误差学习机制，采用利用自适应径向基函数（RBF）神经网络的反向车辆动态学习方案，称为扩展的最小资源分配网络（EMRAN）。 EMRAN使用扩展的卡尔曼滤波器进行在线学习和体重更新，并采用了一种越来越多的/修剪策略，用于维护紧凑的网络，以便更容易地实现。在线学习算法处理参数化不确定性，并消除了未知干扰在道路上的影响。结合用于提高泛化性能的自我调节学习方案，所提出的EMRAN辅助控制架构辅助基本PID巡航和斯坦利路径跟踪控制器以耦合的形式。与传统的PID和斯坦利控制器相比，其对各种干扰和不确定性的性能和鲁棒性以及与基于模糊的PID控制器和主动扰动抑制控制（ADRC）方案的比较。慢速和高速场景介绍了仿真结果。根均线（RMS）和最大跟踪误差清楚地表明提出的控制方案在未知环境下实现自动车辆中更好的跟踪性能的有效性。

translated by 谷歌翻译